Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

The estimation of the direction-of-arrival (DOA) of one or more acoustic sources is an area that has generated much interest in recent years, with applications like automatic video camera steering and multi-party stereophonic teleconferencing entering the market. Time-difference-of-arrival (TDOA) based methods compute each relative delay using only two microphones, even though additional microphones...

chapter

Multi-Channel Source Separation Preserving Spatial Information

Robert Aichner, Herbert Buchner, Meray Zourub, Walter Kellermann

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-5 - I-8

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

In this paper we propose two novel methods for preserving the spatial information in source separation algorithms. Our approach is applicable to any source separation algorithm and is based on an additional supervised adaptive filtering with the reference signals generated by the source separation system. If a special constrained optimization scheme is applied to derive the source separation algorithm...

chapter

Primary-Ambient Signal Decomposition and Vector-Based Localization for Spatial Audio Coding and Enhancement

M.M. Goodwin, J. Jot

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-9 - I-12

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

Spatial audio coding and enhancement address the growing commercial need to store and distribute multichannel audio and to render content optimally on arbitrary reproduction systems. In this paper, we discuss a spatial analysis-synthesis scheme which applies principal component analysis to an STFT-domain representation of the original audio to separate it into primary and ambient components, which...

chapter

Principles and Analysis of the Squeezing Approach to Low Bit Rate Spatial Audio Coding

Bin Cheng, Christian Ritz, Ian Burnett

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-13 - I-16

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper presents a novel solution to multichannel spatial audio coding: Spatial Squeezing Surround Audio Coding (S3AC). The S3AC scheme analyses a multichannel audio signal and downmixes it into a stereo signal pair containing both the monophonic properties of audio sources and their localization information; this avoids the need for side information. The approach uses time-frequency analysis of...

chapter

Acoustic Echo Cancellation for Surround Sound using Perceptually Motivated Convergence Enhancement

Jurgen Herre, Herbert Buchner, Walter Kellermann

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-17 - I-20

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

Acoustic Echo Cancellation (AEC) has become an essential and well-known enabling technology for hands-free communication and human-machine interfaces. AEC for two or more reproduction channels aims at identifying the echo paths between the microphone and each audio reproduction source in order to cancel the associated echo contribution. A number of preprocessing methods have been proposed to decorrelate...

chapter

Efficient Conversion of X.Y Surround Sound Content to Binaural Head-Tracked Form for HRTF-Enabled Playback

Dmitry N. Zotkin, Ramani Duraiswami, Nail A. Gumerov

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-21 - I-24

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

Binaural presentation of X. Y sound is usually performed using virtual audio principles - that is, by attempting to virtually reproduce the setup of the X+Y loudspeakers in the reference room configuration. The computational cost of such playback is linear in the number of channels in the X. Y setup. We present a novel scheme that computes, offline, a spatio-temporal representation of the sound field...

chapter

An Acoustic MIMO Framework for Analyzing Microphone-Array Beamforming

Jingdong Chen, J. Benesty, Yiteng Huang

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-25 - I-28

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

Although a significant amount of research attention has been devoted to microphone-array beamforming, the performance of all the developed algorithms in practical acoustic environments is still far from meeting our expectation. So further research efforts on this topic are indispensable. In this paper, we treat a microphone array as a multiple-input multiple-output (MIMO) system and develop a general...

chapter

Microphone Array Post-Filter using Incremental Bayes Learning to Track the Spatial Distributions of Speech and Noise

M.L. Seltzer, I. Tashev, A. Acero

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-29 - I-32

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

While current post-filtering algorithms for microphone array applications can enhance beamformer output signals, they assume that the noise is either incoherent or diffuse, and make no allowances for point noise sources which may be strongly correlated across the microphones. In this paper, we present a novel post-filtering algorithm that alleviates this assumption by tracking the spatial as well...

chapter

Speech Source Separation by Combining Localization Cues with Mixture Models of Speech Spectra

K. Wilson

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-33 - I-36

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

We present a method for simultaneous speech source separation in reverberant environments using both localization cues and a speech model. Previous source separation work has focused primarily on one or the other of these approaches; we use a novel localization cue observation noise model to allow for a natural combination of the approaches. We model speech as a Gaussian mixture model (GMM) of short-time...

chapter

Source Localization in Reverberant Environments by Consistent Peak Selection

Raffaele Parisi, Albenzio Cirillo, Massimo Panella, Aurelio Uncini

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-37 - I-40

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

Acoustic source localization in the presence of reverberation is a difficult task. Conventional approaches, based on time delay estimation performed by generalized cross correlation (GCC) on a set of microphone pairs, followed by geometric triangulation, are often unsatisfactory. Prefiltering is usually adopted to reduce the spurious peaks due to reflections. In this work an alternative strategy is...

chapter

Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers

Shoko Araki, Hiroshi Sawada, Shoji Makino

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-41 - I-44

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

We propose a speech separation method for a meeting situation, where each speaker sometimes speaks and the number of speakers changes every moment. Many source separation methods have already been proposed, however, they consider a case where all the speakers keep speaking: this is not always true in a real meeting. In such cases, in addition to separation, speech detection and the classification...

chapter

Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA

Kentaro Tachibana, Hiroshi Saruwatari, Yoshimitsu Mori, Shigeki Miyabe, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-45 - I-48

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

In this paper, first, we propose a computational-cost efficient blind source separation combining closed-form 2nd-order independent component analysis (ICA) and nonclosed-form higher-order ICA. The closed-form solution of the 2nd-order ICA has been recently presented by one of the authors. This finding motivates us to combine the closed-form 2nd-order ICA and higher-order ICA, where the preceding...

chapter

All-Pole Spectral Envelope Modelling with Order Selection for Harmonic Signals

F. Villavicencio, A. Robel, X. Rodet

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-49 - I-52

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

We present a study into all-pole spectral envelope estimation for the case of harmonic signals. We address the problem of the selection of the model order and propose to make use of the fact that the spectral envelope is sampled by means of the harmonic structure to derive a reasonable choice for an appropriate model order. The experimental investigation uses synthetic ARMA featured signals with varying...

chapter

Analysis of Musical Instrument Sounds by Source-Filter-Decay Model

Anssi Klapuri

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-53 - I-56

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper proposes a way of modelling the time-varying spectral energy distribution of musical instrument sounds. The model consists of an excitation signal, a body response filter, and a loss filter which implements a frequency-dependent decay. The three parts are further represented with a linear model which allows controlling the number of parameters involved. A method is proposed for estimating...

chapter

Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals

K. Itoyama, M. Goto, K. Komatani, T. Ogata, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 1 > I-57 - I-60

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper describes a sound source separation method for polyphonic sound mixtures of music to build an instrument equalizer for remixing multiple tracks separated from compact-disc recordings by changing the volume level of each track. Although such mixtures usually include both harmonic and inharmonic sounds, the difficulties in dealing with both types of sounds together have not been addressed...

INFONA - science communication portal

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07

Author Index

Covers

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

Copyright page

ICASSP 2006 Conference Committee

Direction of Arrival Estimation using Eigenanalysis of the Parameterized Spatial Correlation Matrix

Multi-Channel Source Separation Preserving Spatial Information

Primary-Ambient Signal Decomposition and Vector-Based Localization for Spatial Audio Coding and Enhancement

Principles and Analysis of the Squeezing Approach to Low Bit Rate Spatial Audio Coding

Acoustic Echo Cancellation for Surround Sound using Perceptually Motivated Convergence Enhancement

Efficient Conversion of X.Y Surround Sound Content to Binaural Head-Tracked Form for HRTF-Enabled Playback

An Acoustic MIMO Framework for Analyzing Microphone-Array Beamforming

Microphone Array Post-Filter using Incremental Bayes Learning to Track the Spatial Distributions of Speech and Noise

Speech Source Separation by Combining Localization Cues with Mixture Models of Speech Spectra

Source Localization in Reverberant Environments by Consistent Peak Selection

Blind Speech Separation in a Meeting Situation with Maximum SNR Beamformers

Efficient Blind Source Separation Combining Closed-Form Second-Order ICA and Nonclosed-Form Higher-Order ICA

All-Pole Spectral Envelope Modelling with Order Selection for Harmonic Signals

Analysis of Musical Instrument Sounds by Source-Filter-Decay Model

Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals

Filter options

Publication date

Keywords

INFONA - science communication portal

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07 $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07